Corpus: deu_news_2000_100K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 88 99 99 99 99
1000 795 982 999 999 999
10000 5703 9375 9928 9972 9983
100000 33440 82331 96986 99390 99826
1000000 33441 82332 96987 99391 99827


Zipf's diagram for sentence endings


Gnuplot diagram

8978 msec needed at 2021-05-10 20:04